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Abstract 

The data processing inequality states that the quantum relative entropy between two states p 
and (7 can never increase by applying the same quantum channel A/” to both states. This inequality 
can be strengthened with a remainder term in the form of a distance between p and the closest 
recovered state {TZoM){p), where 72. is a recovery map with the property that a = {TZo We 

show the existence of an explicit recovery map that is universal in the sense that it depends only on 
a and the quantum channel A/” to be reversed. This result gives an alternate, information-theoretic 
characterization of the conditions for approximate quantum error correction. 


1 Introduction 

Suppose that an experiment, depending on an unknown parameter 6* S 0, is described by classical 
probability distributions v'g on a sample space (V, C). A statistic, i.e., a measurable map Af : (V, fl) —^ 
(y, E) is called sufficient with respect to 0 [12], if the conditional probability does not depend on 0, 
i.e., if there exists a probability distribution v on (X, fl) such that 

iy 0 {X\M{X)) =iy{X\Af{X)) for all 6» e 0 . (1) 

In such a case, Af{X) contains the same information about the parameter 0 S 0 as V does. 

This concept has been generalized to the quantum setup [34, 35, 31, 21]. For two Hilbert spaces 
A and B, let S(A) denote the set of density operators on A and let TPCP(A, H) be the set of trace¬ 
preserving completely positive maps from A to B. Let Q(A) denote some subset of S(A). A quantum 
channel Af G TPCP(A, H) is called sufficient (or reversible) with respect to Q(A), if there exists a 
recovery map 72 G TPCP(H, A) such that 

{TZoAf){p) = p for all p G Q{A) . (2) 

The data processing inequality (also known as monotonicity of the relative entropy) states that the rel¬ 
ative entropy between two states p and cr — defined as D{p\\a) := tr(p(log p—log a)) if supp(p) C supp((T) 
and -boo otherwise — is non-increasing under physical evolutions [29, 44], i.e., D{p\\a) > D{Af{p)\\Af{a)), 
where A/" is a quantum channel. This fundamental entropy inequality has many applications in quan¬ 
tum information theory and is closely related to the sufficiency of Af. As shown in [34, 35, 21, 22], 
for a given cr > 0 a quantum channel Af G TPCP(A, i?) is sufficient with respect to Q(A) if and 
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only if D{p\\a) = D{Af{p)\\M{a)) for all p G Q(^)- (We assume that the set Q(A) is such that 
supp(p) C supp(tT) for all p G Q(A).) Furthermore, it is known that the data processing inequality 
holds with equality if and only if there exists a recovery map TZ G TPCP(-B,A) that simultaneously 
reverses the action of the physical evolution M on both states [34, 35, 36], i.e., {TZ o N){p) — p and 
{TZoJ\f)(a) — a. If the Hilbert spaces A and B are assumed to be separable and a G TC(H) is positive 
semi-definite, then the recovery map can be taken as the Petz recovery map Va,j\r (also known as the 
transpose map), defined as the unique solution to 

(a 2 ,A/''f(ai))CT = (P^ j^(a 2 ),ai)A^(^) for all 01,02 GL(H), (3) 

where L{B) denotes the set of bounded operators on B, TC(H) the set of trace-class operators on 
A, {a,b)^ := tr{a^uj 2 buj^) is a weighted inner product with ui taken to be positive semi-definite and 
trace class [34, 35, 36], and denotes the adjoint map of Af. The Petz recovery map is completely 
positive and trace non-increasing (see, e.g., [33]). In the case that the Hilbert spaces A and B are 
hnite-dimensional, then, on the support of Af{a), the Petz recovery map takes the form 

Pcr,Af ■ Xb (J^Af^{Af{a)~^XBAf{a)~^)a^ . (4) 

(Following the standard convention, a~^ is defined to be the inverse of a, when a is considered as an 
operator acting only on the support of a.) 

The concept of sufficient statistics can be made robust. A quantum channel Af G TPCP(A, H) is 
e-sufficient with respect to Q(A) if there exists a recovery map TZ^ G TPCP(H, A) such that [20] 

^\\p-{TZ,oAf){p)\\-^< e for all p G Q(A) , (5) 

for some e G [0,1]. Together with the case e = 0 discussed above, this motivates the question if there 
exists a refined version of the data processing inequality in terms of recoverability [47], which would 
serve as an alternative characterization of approximate sufficient statistics. 

An inequality that is closely related to the monotonicity of the relative entropy is the strong subaddi¬ 
tivity of quantum entropy [26, 27], which ensures that for any tripartite state pabc G S(A ® B®C) the 
conditional mutual information is non-negative, i.e., I{A : C'|i?)p := H{AB)p -\- H[BC)p — H{ABC)p — 
H{B)p > 0, where H{A)p := —tr(pyi logp^) denotes the von Neumann entropy. This inequality has 
been strengthened recently with a remainder term in the form of a distance to the closest recovered 
state. It was shown in [II], that for any density operator pabc there exists a trace-preserving completely 
positive map (the recovery map) TZb^bc such tha 

I{A : C\B)p > —2\ogF{pABC,T^B^Bc{pAB)) , (6) 

where the fidelity of p and a is defined by F{p,a) := ||Y^-yCT||i' If ^ finite-dimensional 

Hilbert spaces, on the support of pB, TZb^bc can be taken as a rotated Petz recovery map, i.e., a 
trace-preserving completely positive map of the form 

Xb'-A VbcPbc^Pb^^bXbU'^b®^^c)Pbc^bc ’ (1) 

where Vbc and Ub are unitaries on H 0 C and B, respectively. 

The result of [11], whose proof is based on de Finetti type arguments and properties of Renyi 
entropies, has been extended and generalized in various ways. In [7], based on the quantum state 
redistribution protocol [9] and de Finetti type arguments, it was shown that the fidelity term can be 
replaced by a measured relative entropy Dm, which is never smaller than the fidelity term, i.e., 

I{A : C\B)p > DM{pABc\\Ti-B^Bc{PAB)) > —‘Z-\ogF{pABC,Ti-B^Bc{PAB)) ■ (8) 

The measured relative entropy is defined as the supremum of the relative entropy with measured inputs 
over all positive operator-valued measures (POVMs) AA = {M^}, i.e., 

Dm{p\\(j) := sup|D(A4(p)||A4(fT)) : A4(p) = ^ tr(pMa;)|x)(a;| with ^ = id| , (9) 
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where {|a;)} is a finite set of orthonormal vectors. We note that the tighter bound from [7] came at the 
cost of losing all information about the structure of the recovery map. In [42], it was shown that there 
exists a recovery map both satisfying (8) and possessing a universality property, in the sense that it 
only depends on the marginal psc- Furthermore, for a linearized version of (6) it was shown that the 
recovery map has the form of a rotated Petz recovery map with commuting unitaries, i.e., a recovery 
map of the form in (7) where Vbc and Ub commute with pBC and pb, respectively. 

In view of approximate sufficiency of quantum channels discussed above, it would be helpful to 
have a generalization of (6) in terms of relative entropies. This has been established in [45] with a 
proof technique based on the notion of a Renyi generalization of a relative entropy difference [39] and 
Hadamard’s three-line theorem. It was shown that for any two states p and a with supp(p) C supp(cr) 
and any channel J\f there exists a recovery map TZ such that (72. o J\f) {a) = a and 

D{p\\a)-D{U{p)\\N{a))>-2\ogF{p,{Non){p)) . (10) 

Furthermore, the recovery map was shown to be a rotated Petz recovery map with unitaries U and V 
in the algebra generated by cr and Af{a), respectively. Very recently, another different proof technique 
was found [43], based on the concavity and monotonicity of the operator logarithm, which shows that 
there exists a recovery map 72 such that 

D{p\\a) - D{Af{p)\\Afia)) > DM{pmoAf){p)) (11) 

>-21ogF(p,(Aro72)(p)) . (12) 

The recovery map was shown to be a convex combination of rotated Petz recovery maps with unitaries 
U and V in the algebra generated by a and respectively, and therefore satisfies [TZo N){(j) — a. 

Neither in [45] nor in [43] could the recovery map satisfying (10) and (11), respectively, be shown 
to be universal, in the sense that it could be taken independent of p. In this article, we show that for 
separable (not necessarily finite-dimensional) Hilbert spaces there exists an explicit, universal recovery 
map 'R-cr,Af that fulfills (10) and depends only on cr and J\f. In addition, we show that there exists 
a universal recovery map that satisfies (11) for finite-dimensional Hilbert spaces. We note that by 
the Fuchs-van de Graaf inequality [13] the fidelity can be transferred into a trace distance term such 
that (10) and (12) provide alternative characterizations for approximate sufficient statistics. 

Result. We show that for any non-negative operator a and for any channel Af there exists an explicit 
and universal recovery map 'R-a-,Ar such that 

D{p\\a) - D{N'{p)\\Afia)) > -2\ogF{p,{n„x o M'){p)) (13) 

for all density operators p such that supp(p) C supp(cr). A consequence of the universality of the 
recovery map TZa,j\r is that (Jlc,Ar oJ\f){a) = a. We refer to Theorem 2.1 for a more precise statement. 

Rotated and twirled Petz recovery map. We next introduce two classes of recovery maps that 
take the form of a rotated (or twirled) Petz recovery map. These are the building blocks for an explicit 
recovery map that satisfies (13). In [45], for t G K the following rotated version of the Petz recovery 
map (given in (4)) has been introduced 

: Xb ^ . (14) 

A less specific form of a rotated Petz recovery map is given by 

■XB^VVaM{UXBU)V , (15) 

where U and V denote unitaries that commute with Af{(j) and tr, respectively. Let U^.a be the set of 
unitaries on A that commute with a. Let P(A) denote the set of non-negative operators on a Hilbert 
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space A. (If A is not finite-dimensional, we take the convention that operators in P(A) are trace-class.) 
For any a G P(^) and A/” G TPCP(A., B) we denote the convex hull of rotated Petz recovery maps by 

RcrX ■= : U and V &Ua,A^ ■ (16) 

We note that the rotated Petz recovery map trace non-increasing and completely positive. For 

any a G P(A), we define 

S,t(A) := {p G S{A) : supp(p) C supp(cr)} . (17) 


2 Main results 

Theorem 2.1. Let A and B be separable Hilbert spaces. For any a G P(gI), any p G So-(A) and any 
M G TPCP(T, B) we have 

D{p\\a)-D{N{pW{a)) > -21ogF(p, (7^.x °-A7)(p)) , (18) 


where 

(19) 

jR 

on the support ofJ\f{a) withlZ^^j^ : Xb H> a~^*Vcr,N'{Af[(jy* Xb and a probability density 

function on ffi. defined by 

Po{t) ■■= |(cosh(7rt) -b 1)”^ . (20) 

If A and B are separable, 'Pa,Af is defined as the adjoint of the unique linear map satisfying (3) 

with domain supp(A/'((T)) and range supp((T). If A and B are finite-dimensional, this unique linear map 
'Per,AT is given by (4). 

Figure 1 depicts the probability density /3o as a function of < G K. We note that the recovery map 
IZa.Af that satisfies (18) can be chosen such that it projects everything outside of the support of A/’(ct) 
to zero. For the case of finite-dimensional Hilbert spaces we can tighten the inequality (18) at the cost 
of losing the explicit form of the recovery map. 

Theorem 2.2. Let A and B be finite-dimensional Hilbert spaces. For any a G P(A.) and any Af G 
TPCP(H, H) there exists a recovery map Pa,Af inside the set Ra,Ar defined in (16), such that for all 
p G Sct(^) 

D{p\\a) - D{j\f{p)\\j\f{a)) > Du{p, (TZa ,at o J\f){p)) (21) 

>-21ogF(p,(i^,XO-AA)(p)) . (22) 

We note that the measured relative entropy is a quantity that satisfies several desirable properties 
(see, e.g., [42, Lemma B.3]). Therefore the bound in (21) can be particularly helpful when it is used 
as a proof tool (see, e.g.. Corollaries 5.2 and 5.3). Furthermore, as discussed in [7] the right-hand side 
of (21) can be substantially larger than (22) in some cases. 

Remark 2.3 (Functoriality properties). The recovery maps Pa, a/ and Pa,Af stated in Theorems 2.1 
and 2.2, respectively, satisfy apart from (18) several desirable “functoriality” properties. Some of them 
have been stated in [47, 25, 45]. Since both recovery maps satisfy these properties, we abbreviate them 
both as Pa,Af in what immediately follows. 

1. Universality. The recovery map does not depend on p. This follows directly from Theorem 2.1. 
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2. Perfect reconstruction of a from The recovery map satisfies {TZa,M ° = o'- This 

is clear from the fact that any rotated Petz map of the form in (14) perfectly recovers a [45], and 
thus so does any convex combination of these maps. Alternatively, as the recovery map predicted 
by Theorem 2.1 that satisfies (18) is universal, the assertion follows by choosing p = (T/tr(cr). 

3. Normalization. In case Af = I, where I denotes the identity map, we have TZcr,j^{-) = n£r(-)ncr. 
Thus if (T has full support the recovery map is equal to the identity channel. This follows directly 
by [45, Section 4.1] and by definition of the recovery map 

4. Stabilization. For any a S P(A), any Af S TPCP(A, i?), any reference system R, and any 

full-rank r G P(f?), we have 'R-cr^TR,Ar®XR(,') = T^a,j\r This follows by combining [45, 

Section 4.2] together with the normalization property discussed above. 

We note that by following [45, Sections 4.2 and 4.3] together with the strengthening of Theorem 2.1 
given in (23) we can obtain remainder terms that fulfill some parallel and serial composition rules. 

The proof of Theorem 2.1 consists of two parts. We first prove the statement for finite-dimensional 
Hilbert spaces A and B by employing a strengthened version of Hadamard’s three-line theorem that 
is due to Hirschman [18]. By an approximation argument we show that the result remains valid for 
separable Hilbert spaces. Theorem 2.2 is proven differently. Via Sion’s minimax theorem [41, Corol¬ 
lary 3.3], we show the existence of a universal recovery map that satisfies (21) for any finite set of states 
p £ S(A) (see Step 1). We then show, by using the concept of an e-net, how this implies the existence 
of a universal recovery map that satisfies (21) for all p G S(A). 



t 


Figure 1: This plot depicts the probability density /3o defined in (20) as a function of t G K. We see 
that it is peaked around t = 0 which corresponds to the Petz recovery map, i.e., ~ 


3 Proof of Theorem 2.1 

Step 1: Proof for finite-dimensional Hilbert spaces 

In this step we assume that the Hilbert spaces A and B are finite-dimensional. We note that we will 
prove a slightly stronger version of Theorem 2.1 that by concavity of the logarithm and the fidelity 
immediately implies Theorem 2.1. 

Stronger version of Theorem 2.1. For any a G P(A), any p G Scr(A) and any Af G TPCP(A, H) 
we have 

D{p\\a)-D{Afip)\\Af{a)) >-2 [ dt(3o{t) logF(p, (7^|^ o Ar)(p)) , (23) 

Jr 

where TZ\, j^{-) is defined in (14) and the probability density function Poff) := ^(cosh(7rt) -|- 1)“^. 
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Remark 3.1. Inequality (23) together with the fact that the mapping 1 1 — Ti-l, is continuous implies 
that for any a £ P(^), p £ Scr(A) and N £ TPCP(A, R) such that D{p\\a) = D{M{p)\\N{a)) we have 
° ■^)^p) ~ P o.A/’)(cr) = cr for all t £ R with ) defined in (14). This follows 

because F{uj,t) £ [0,1] and F{uj,t) = 1 if and only if w = r for density operators w and r. 

Our proof of (23) is similar to the approach taken in [45]. There are two main ingredients: a Renyi 
generalization of a relative entropy difference [39] and Hirschman’s improvement of the Hadamard three- 
line theorem [18]. We begin by recalling these two ingredients and then proceed to a proof of (23). 

Let L(A) denote the space of bounded linear operators acting on a Hilbert space A. For any L £ L(H) 
the Schatten p-norm is defined as 


:= (tr(|L|^))^ for p£[l,oo), (24) 

where |L| := v^ZTl. A Renyi generalization of a relative entropy difference is defined as [39] 

Aa(p,o-,A/') := ^^log 
a — 1 

where a £ (0,1) U (l,oo), and Ua^be is an isometric extension of the channel Af. That is, Ua^be is 
a linear isometry satisfying iT:E{U a^be{')U\^be) ~ ^a^be^a^be = idA- All isometric 

extensions of a channel are related by an isometry acting on the environment system E, so that the 
definition in (25) is invariant under any such choice. Recall also that the adjoint A/”^ of a channel is 
given in terms of an isometric extension U as A/’i(-) = C/i'((-) (8) ids)!/. 

Lemma 3.2 ([39, 45]). The following limit holds for p, a, and Af as given in the statement of Theo¬ 
rem 2 . 1 : 

lim Aa{p,a,Af) = D{p\\a) - D{Af{p)\\Af (a)). (26) 

a—¥l " 

For Q! = i, observe that 


[Wip)] ^ [A/'(cr)] ^ 0 idis j Ua^be O' P^ 


(25) 


A 1 (p,tT,A/") =-2log ([A/'(p)]= [A/'(cr)] 2 (g) idfi^ Ha^bb0-2/92 ^ =-2\ogF(^p,Va,Ar{Af{p))^. (27) 


The following lemma is based on Hirschman’s improvement of the Hadamard three-line theorem [18], 
and for completeness, we provide a proof in Appendix A. 

Lemma 3.3. Let S := {z £ C : 0 < Re{z} < 1} and let G : S ^ L{T~L) be a bounded map that is 
holomorphic on the interior of S and continuous on the boundary. Let 9 £ (0,1) and define pg by 


I _ 1 - 6 9 

P0 Po Pi 


where po, Pi £ [l,oo]. Then the following bound holds 

log(||G(0)||pJ < f dt (ae(t)log(||G(it)||p”®)-b^e(t)log(||G(l-bit)||pj) , 


where ag(t) and Peft) are defined by 

sin(7r0) 


ag{t) := 


and Peit) := 


sin(7r0) 


2(1 — 6*) (cosh(7rt) — cos(7r0)) 20(cosh(7rt) -|- cos(7r0)) 

Remark 3.4. Fix 9 £ (0,1). Observe that ag{t), Pg{t) > 0 for alH £ R and we have 


(28) 


(29) 


(30) 


f dt ae(t) = f dt Pg{t) = 1 

Jm. Jr 


(31) 
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(see, e.g., [16, Exercise 1.3.8]) so that ae{t) and /3e(t) can be interpreted as probability density functions. 
Furthermore, the following limit holds 




(32) 


where Pq is also a probability density function on R. 


We can now readily establish the desired result in (23). In what follows, we abbreviate the isometric 
extension Ua-^be of the channel Af as U. Pick 


G{z) := (^[A/'(p)]= [A/’(cr)] 2 (gjide^ , 


(33) 


Po = 2, Pi = 1, and 9 G (0,1), which fixes pg = The operator valued-function G{z) satisfies the 
conditions needed to apply Lemma 3.3. For the choices above, we find that 


l|G(0)|| 


2 


(^[Af{p)]^ [■^(<^)] ^ ^ids) Ua^p^ 


2 ’ 
1+0 


(34) 


and 


as well as 


l|G(it)||2 = I ([Ar(p)]^ [Ar(a)]-^ ® ids) 


< 


= 1 


(35) 


l|G(l+it)||i 


[A/’(p)]^~ [Aficr)] ~ (g) ids) U a^~p^ 

(jAfip)]^ [A/’(p)]^ [Af{a)]~^ [A/’(cr)]~2 0ids) Ua^a^p'^ 
(y^{p)]^ [A/’(cr)]~^ (8) ids) Ua^a^p'^ 

F{pAKjG°^)ip)) ■ 


(36) 


Then we can apply the fact that ||G'(it )||2 < 1 and (29) to conclude that the following bound holds for 
all 9 G (0,1) 


< 

2 — 
TT^ 

which implies 


log 


[A/’(p)]2 [A/’(cr)] 2 (g) id^; ] [/cr2p 


2 

9 


log 


^[A/’(p)]2 [-^(cr)] = ®ids) Ua^^p^ 


2 

TT^ 


Letting 9 = we see that this is the same as 


Aa{p,<7,Af) > -2 


dt Pi— 


J^dt Pg{t)\og(^F{p,{TZl_^oAf){p)Y^ , 
>-2 [ dt Pg{t)\ogF{p,{nlj^oJ\f){p)) . 

JR 

(t)logF(p, {nlMoM){p)). 


(37) 

(38) 

(39) 


Since the inequality in (38) holds for all 9 G (0,1) and thus (39) holds for all a G (i, l), we can take 
the limit as a y 1 and apply (26), (32), and the dominated convergence theorem to conclude that (23) 
holds. 


Remark 3.5. If A and F are finite-dimensional Hilbert spaces the statement in Theorem 2.1 can be 
slightly generalized. For any cr > 0 and any trace non-increasing completely positive map A/ with Kraus 
operators {iV^} such that 0 ^ < id the recovery map defined in Theorem 2.1 satisfies 

(18) for all subnormalized density operators p > 0 with 0 < tr(p) < 1 such that supp(p) C supp((T). 
This follows by the same argument given in Step 1 by using Ua^be = ® 1*)^^ isometric 

extension of the trace non-increasing and completely positive map Af. 
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Step 2: Extension to infinite dimensions 

In this step the Hilbert spaces A and H are assumed to be separable (not necessarily finite-dimensional). 
We will show how to lift the stronger version of Theorem 2.1 for finite-dimensional Hilbert spaces, such 
that it can apply to states p and cr and a channel Af associated with separable Hilbert spaces. This is 
accomplished via a limiting argument. Let {HJ^joeN and {H^jbgN be sequences of finite-rank projectors 
on A and B, respectively, that converge to idA and ids, respectively, with respect to the weak operator 
topology, meaning that 

lim {ip\AW(j)) = {ip\(t)) (40) 

a—^oo 

for all vectors |(^), |V’) G A (similarly for H^ —>• ids)- For a G P(A) and p G Scr(A) we consider projected 
versions 


and p'^— A^pA^ . (41) 

We note that the sequences {/9“}aGN and {cr“}aGN converge to p and ct, respectively, in the trace norm 
(see, e.g., Corollary 2 of [15] or Lemma 11.1 of [19]). Let S“ be the set of non-negative operators that 
is generated by (41) for all p G S. For any J\f G TPCP(H, B) we define its analogue with a projection 
at the input and output as 

. (42) 

Note that converges to Af in the weak operator topology, in the sense that 

hm (</)'lAr“’''(l<(.)(^l)lV'') = (</''|Ar(l</')(V'l)IV''), (43) 

a,b—¥(yD 


for all vectors \(j)), Itp) G A and \(j)'), \tp') G B (cf., the discussions in [40]). 

We start by proving two lemmas that show how the difference of relative entropies and the fidelity, 
respectively, change when considering projected states. 

Lemma 3.6. For any a G P(A), any p G Scr(A), and any N G TPCP(H, B), we have 

limsuplimsupD(p“llCT“)-D(Af“’^(p“)||Af“’^(a“)) < D{p\\a) - D{Af{p)\\ Af{a)) . (44) 

a^oo b—^oc 


Proof. By Lemma 3 in [28] and since the relative entropy is lower semicontinuous [19, Exercise 7.22], 
we obtain 


74(p“lk“) < Dip\\a) < hminf 74(p“lla“) , 


(45) 


which implies that 


lim H(p“1]ct“) = Z?(pl]cr) . (46) 

a—¥co 

The lower semicontinuity of the relative entropy implies that 

liminfliminfD(A7“’''(p“)llA/'“’^(cr“)) > liminf H(A7(p“)llA7(cr“)) (47) 

>D{N{p)\\N{a)) . (48) 

This proves the assertion. □ 


Lemma 3.7. The Petz recovery map V^a j/a,h defined in (4) satisfies lima^oo limf,_i.oo= 'Pa-, AT 
(in the sense of (43)^, where Pa.Af is a recovery map that is defined as the (unique) linear map with 
domain supp(A/'(cr)) and range supp(cr) satisfying 

{a 2 ,AJ"^{ai))^ =for all m G L(H), 02 G L(H) , (49) 

with the weighted inner product {a,b)^ := ti{a^ujibuj^) and ui positive semi-definite and trace class. 


Proof. We begin by outlining the proof. As shown by Petz [34, 35, 36] (see also [33, Chapter 8]), the 
map Vfja j^a.b defined in (4) is the unique linear map with domain supp(A/'“’^(cr“)) and range supp((T“) 
satisfying 

( 02 , for all oi e L(B), 02 G L(A) , (50) 

and the map Va-.Af is the unique linear map with domain supp(A/'(ct)) and range supp((T) satisfying 


(a 2 ,(A/')'''(ai)) =(pj[^(a 2 ),ai) for all oi G L(B), 02 G L(A) 

\ ’ / A/(cr) 


We will first show that 


lim lim {a 2 , = {a 2 ,ff\ai)) 

a—)-oo b^co ' a \ i c 


(51) 


(52) 


M((t) 


Af(a) 


for all oi G L(il),a 2 G L(A), which by (50) and (51) implies that 

lim lim (via. j^^,bia2),ai\ ^ aa(« 2), oi) 

a-^Oab^Oc\ .A/ ’ ' \ V / 

for all oi G L(il),a 2 G L(A). After showing that 

lim lim (T^L b ( 02 ), «! ) = lim lim ( Pj'a 5 ( 02 ), Oi 

we can conclude from (53) and (54) that 

lim lim 6 ( 02 ), ai\ =/pl ^( 02 ), oi) 

a—^oo b—>00 \ ■A' / \ "^ / A/((t) 

From there, we argue that this implies the convergence lima_).oo limb-i .00 V„o. j^a.b = Va,Af, in the sense 
of (43). 

Thus, we need to establish (52) and (54), and we begin by proving (52). Consider that 

(a2,(Ar“''')l(ai))^ =tr(4(a“)^(AA“'^)t(ai)(a“)^) . (56) 

Then to prove (52), we need to show that the following limit holds for all bounded oi and 02 ' 


(53) 


(54) 


(55) 


lim lim 

a—¥00 6 —>-oo 


tr(4(a“)^(Ar“’^)l(ai)((T“)5)-tr(4a^Arl(ai)a5) 


= 0 . 


(57) 


From the fact that any bounded square operator can be written as a sum of a real part and an imaginary 
part, each which of in turn can be written as a sum of a positive and negative part [23, Chapter 2], it 
suffices to prove the limit in (57) for positive semi-definite ai and 02 . So we take ai and 02 positive 
semi-definite in what follows. Consider that 


tr (a 2 (a“) 4 Ar“’^)^(ai)(CT“)^) - tr (a 2 <J^AfHai)ai 

tr (02 [(a“)4AA“’'’)l(ai)((T“)^ -(a“)5A7t(ai)((T“)5]) 

tr ^02 (ct“) 2 A/'l(ai)(cr “)2 — 0 - 2 A/'l(ai)CT 2 ^ 


-b 

We begin with the first term on the right of (58): 

tr (a 2 [(a“)4Ar“’')^(ai)(a“)5 - (a“)^A7l(ai)(fT“)^]) 

= tr((A7“''’[(a“)5a2(CT“)5 
< Ar“’4((T“)5a2(a“)4 -AT 


(58) 


-A7 


(a“)5a2(a“)5])ai) 


(cr“)2a2(cr“)2 


«1 


(59) 

(60) 
































where we applied the Holder inequality in the last line. So now we focus on bounding the trace norm 
above. Consider that (cr“)2a2(cr“)2 is a trace-class operator because 


= Kllilk2lL 

<lkllilk2lL. 


So then 


j^a,b 


K)5a2(a“)5 


-AT 


(a“)5a2(a“)5 


= 

O^AT 

n^(a“) 5 a 2 (a“) 5 n^ 

O^-AT 

(0-“)) 02(0-“) 2 

1 

< 

O^AT 

n^(a“) 5 a 2 (a“) 5 n^' 

O^-AT 

n^(a“)^a2(a“ 



-f 


N 


n^(a“)^a2(a“)5n^ 


-N 


(a“)5a2(a“)2 


We now bound the last two terms. Let 


LOb — 


M 

0(4(0-“) 2 02(0-“) 2 n(^ 


tr 

A 

0(4 (0-“) 5 02 (o-“) 5 0(4 

1) 


Then the first term is bounded as 




n^K)5a2(a“)^n^ 


n^(fT“)5a2(a“)tn^ 


< 


tr (n [n^(a“)5a2(a“)^n^]) ||n^w“n^ - w“||^ 
tr(Ar[n^(a“)5a2(a“)m^]) 2^1 - tr(n^a;“) 


( 61 ) 

( 62 ) 

( 63 ) 


( 64 ) 


( 65 ) 


( 66 ) 


( 67 ) 

( 68 ) 


where the last line follows from a standard result (see, e.g., [ 42 , Lemma A. 2 ]). Taking the limit as 
6 —>■ 00 and as a —^ oo then makes this term arbitrarily small. For the other term, consider that 


AT 


0(4(0-“) 2 02(0-“) 2 n(4 


-N 


(o-“)2a2(o-“)2 


< 


n^(cT“)5a2(a“)5n^ - (a“)5a2(a“)5 


= 0 , 


( 69 ) 


where the last equality follows because 0(4(0-“) 2 = (o-“) 2. 
Now we handle the second term on the right in ( 58 ): 


tr ^02 (o-“)2A/’^(ai)(o-“)2 — 

tr (02 [(a“)5A7t(ai)(a“)5 - (a“)^Art(ai)a^ 

tr ^02 (o-“) 2A/'t(ai)CT2 — 0-2 A/'^(ai)CT2 ^ . ( 70 ) 
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The first term on the right in (70) is bounded as follows: 

= tr(a2(a“)^Art(ai) [(a“)5 


< 


< 


tr (a 2 cr“) tr ( [A/"^(ai)] ^ (ct“) 2 — 0-2 


l|a2|lL l|aillLtr(o-“)tr ( (cr“)2_cr2 


<l|a2lLI|ai|U||a“|lf 

<l|a2lLI|ai|L||a||nik“-^lli]^ 


(71) 

(72) 

(73) 

(74) 

(75) 


The first inequality follows from the Cauchy-Schwarz inequality. The last inequality is a consequence 
of the Powers-Stormer inequality. By essentially the same reasoning, we obtain the following bound on 
the other term on the right in (70): 


tr(a2 (cr“)2A/'1'(ai)cr2 -crW^(ai)CT2 ) < ||a2|l^ ||ai|l^ |lcr||i" 
Putting everything together, we find that 

tr(a|(cr“)5(A/’“’^)^(ai)(cr“)5) — NYai)(j^) 

<tr(Ar[n^(a“)^a2(CT“)5n^]) 2Y^l-tr(n^w“) 


(76) 


+ 2||a2|L||ai|L||a||}/"[||a“-a||J^ (77) 


from which we can conclude (52). 

We now prove (54), by a reasoning that is very similar to the above. Consider that 


= 


< 


"_^a,h(^a)i __^(^)i' 

) 



+ tr (T’^„ ^^„,t(a2) 

Ar“’*'(tT“)5 -Ar(a)^ 

aiAf{a)^^ 


(78) 


(79) 


By the same methods as given above, we find that 
tr (pl„,^„,.(a2)Ar“’^(a“)5ai - N{a)"^ 

< I|a2|loo llailloo Iklli [||A/'“’^(cr“) - A/'(ct 


(80) 


and 


tr (P]„_^„,,(a 2 ) 

<ll«2lLI|ai||^||a||f [||Ar“’^(a“)-Ar(a)||J^ . (81) 

Finally, consider that 

\\Af-’\aY - ^fY)\\l ^ ||A/'“’'K)-A/'“’''(^)||i + ||A^“’''(^)-Ar(a)||^ (82) 

< ||ct“ - ct|Ii + ||n'’Ar(n“an“)n'' - Ar(n“crn“)||^ ||Ar(n“an“) - Ar(a)||^ (83) 

< ||ct“ - ct|Ii + ||n'’Ar(n“an“)n'' - Ar(n“an“)||^ + ||n“an“ - a||^, (84) 
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which converges also in the limit as 6 —>■ oo and as a —> oo. Putting everything together, we find that 

i-Pl 

< 2lloalL ll«illoo Ikllf [2lk“ - <^lli + ||tfA/-(nvn“)n'-Ar(nvn“)||J" , (85) 

which allows us to conclude (54), and in turn, (55). 

It remains to argue for the convergence lima_j.oo limb^oo T^^-a = T’a-,^, in the sense of (43). 

Consider that (55) implies that 

lim lim = {<p'W(cr)ivl j^(|((>)('0|)Ar(cr) 5 ji/,'), (86) 

for all \(j)), lip) € A and \4>'), \ip') € B. This establishes convergence of to Vl ^ for all |0), \p;) G A 

and \ip') G supp(A/'(cr)), which in turn allows us to conclude convergence of V^a j/a,b to Va,M- D 

Before stating the following lemma, we introduce the shorthand 

■■ X ^ a;“Xa;-‘* , (87) 

where w is a positive semi-definite operator. 

Lemma 3.8. For any a G P(^), any p G So-(A), any Af G TPCP( 2 l, i?), and all t G K, the following 
limit holds 


lim hm F(p“, (nl. o A7 “’'’)(p“)) = F{p, o N){p)) . (88) 

Proof. Consider that we have the following convergences: 

lim lim Ujy-a.br^a) t = Ujytrj) t and lim = lla,t, (89) 

a—^ciob—¥oo V /) >. / a—)-oo 

each understood to be in the weak sense in (43). The serial concatenation of weakly converging channels 
converges to the serial concatenation of the limit, so that lima_j.oo lim&^oo77.^a j^j-a,b oA/’“’^ = 
where we used the above and Lemma 3.7. Then we can conclude that the fidelity converges because it 
is continuous in its inputs (see, e.g., [11, Lemma B.9]), and for our case considered here, convergence in 
the weak sense implies convergence in the trace norm [19, Chapter 11]. This proves the assertion. 

□ 

By invoking the stronger version of Theorem 2.1 for finite-dimensional Hilbert spaces (that has been 
proven in Step 1)^ together with Lemmas 3.6 and 3.8, the fact that any separable Hilbert space is 
isomorphic to [37, Theorem H.7], and the dominated convergence theorem, we find for any p G Scr(H) 


0 < limsuplimsup D{p°-\\a°-) - L»(A7“’''(p“)||A7“’'’(ct“)) 

O—>CXD 6—foo 

+ 2 f dt Mt) \0gF{p\ {Til O A7“’')(p“)) 
JR 

<D{p\\a)-D{M{p)ma))+2 [ dt/3o(t) logF(p, (7eJ_^ o A7)(p)) . 


(90) 

(91) 


This establishes the stronger version of Theorem 2.1 in (23). We can then apply concavity of the 
logarithm and the fidelity to conclude Theorem 2.1. 

^Recall that by Remark 3.5, Theorem 2.1 remains valid for subnormalized states and a trace non-increasing completely 
positive map. 
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4 Proof of Theorem 2.2 


For this proof we first introduce some notation. For any H £ P(^) where P(A) denotes the set of 
non-negative operators on A and where A is a finite-dimensional Flilbert space let H = X]fee[d] ^kPk be 
an eigenvalue decomposition of H, where d < dim(A) and {Pk}k^[d\ are mutually orthogonal projectors. 
For (j) = ((^ 1 ,..., (l)d) £ [0, 27r] we dehne the unitary 

Ufj := ^ exp(i(/)fe)Pfc . (92) 

kG[d] 

With the help of these unitaries we define yet another rotated Petz recovery map 

■■ Xb ^ . (93) 

For any tr £ P(A) and Af £ TPCP(y4, B) we denote the convex hull of rotated Petz recovery maps by 

:= conv: d G G [0, , (94) 

where di < dim(A) and c ?2 < dim(i3). We note that the rotated Petz recovery map is trace 

non-increasing and completely positive. We further note that clearly C To-, a/" with Ro-x defined 

in (16). 


Step 1: Proof for a finite set of states 

For any a G P(A) consider a finite set of density operators with G Sct(^) for all a; £ A" and 

\A\ < 00 . Furthermore let z/ be a probability measure on X. For any channel Af G TPCP(A, B) we 
define the following non-negative operators 

pxA i'(x)lxXxlx ^ and ctata i^(a:)|a;)(x|x ® cr = px ® o'. (95) 

xGX 


This then gives 

[(Ix ®Af){axA)]~^ = [{px ( 8 )^( 0 -))] "" = Px^ ®Af{a)~^ . (96) 

Since (Tx ® A/’)’^ = Tx ® Af"^ and = (px 'S> cr)i = p^ 0 cr^, the Petz recovery map for ax a and 
Px < 8 > Af can be written as 

Yxb TaxA,ix»M (Yxb) = ^xA^^x ® Af)^ {[{Px ® Af){ax a)\~Yxb [{Px <8) A/’)(crxA)]~^) ^xx 

= {Px ® ^^){Px ®Af''){{px^ ®Af{a)~^)YxB{Px^ ® f^{^)~^)){Px ® 

= (nsupp(!y) ®Tc,m){Yxb) , (97) 


where nsupp(j,) denotes a projector onto the support of v. Furthermore, we have that for any f) G 
[0,27r]^'^i and for any <p £ [0,27r]^'^2 there exist d,d G [0,27r]^‘^i and (p,Cp G [0,271]^^^^ such that 


ut 


= Ut 


and 


TJV 

(naupp(,,) 0M}(crxA) 


= U'^ 

^ Px 




Theorem 3.3 of [43] ensures that 


(98) 


D{pxA\\crxA) - -D((nsupp(0 ® A/')(pxA)||(nsupp(0 <^Af){axA)) 

- mm{DM{pxA\\{%xA,ixr»Ar ° nsupp(p) 0 A/')(pxa)) : TaxA,n,^pp(u)0Ar ^ T,^^^^nappp(p)(8)A^} > 0 . 

(99) 
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Combining (95), (96), (97), and (98) shows that for any d G [0,27r]^‘^^ and for any ip G [0, 27r]^'^2 there 
exist d G [0, 27r]^'^i and G [0, 27r]^'^2 such that 

'^x!,n,,pp(,,)®AA(PxA) = Hx)\x){x\x ® 7 ;^;^(p") • (100) 

x&X 

This, together with a simple property of the relative and the measured relative entropy ensuring that 
they decompose for orthogonal states (see Lemmas B.2 and B.3 in [42]), implies that (99) can be 
rewritten as 

max i^(x)(L>(p'^||cr) - £)(A/'(p'^)||A/'(cr)) - £>m(p'^||( 7^X o A/')(p'^))') > 0 . (101) 

T,7,Ar&Ta,X ^ — i, ^ ^ 

x^X 

We note that the set To-x (defined in Equation (16)) is compact since TPCP(B, A) is compact (see [42, 
Remark C.3]) and since the intersection of a compact set with a closed set is compact. Furthermore, 
the mapping 3 '-3 L1 m(p'^||(7ctW ° Af){p^)) G M+ is lower semicontinuous (see proof of 

Lemma 4.2). By the extreme value theorem this implies that the maximum in (101) is attained. 

Since (101) is valid for any probability measure v on X we obtain 

min max j^(x) (£)(p'^||cr) - £>(A/'(p“)||A/'(cr)) - Z1 m(p“||( 7)rW o A/')(p^))') > 0 , (102) 

xex 

where P(A’) denotes the set of probability measures on X. We note that the minimum is attained. To see 
this, we first remark that the set X is clearly compact and as a result, the set P(A’) is weak* compact [1, 
Theorem 15.11]. Furthermore, the function P(T’) 9 /r H> J2xex ~ ~ 

DM{p^\\{Ta,Ar ° Af){p^))] G M is continuous with respect to the weak* topology, since expectation values 
are continuous in the weak* topology. The extreme value theorem thus ensures that the minimum is 
attained. 

To simplify notation, let 

:=y2i^{x)(^Dip-\\a)-D{M{pn\\f^ia))-DMip%%M°^)ipl)) ■ (103) 

xGX 

For any Ta,j\r S T^Xi the function F{X) B v f{j^,Ta,M) G R is linear and bounded (since G 
Sct(^) for all X € X) and hence continuous. Furthermore, for any ly G F{X) the function T^x 3 
'T'a,M '-3 f{iy,Tcr,Af) G R is concave [42, Lemma B.4] and upper semicontinuous (as shown in the proof of 

Lemma 4.2). The sets F{X) and T^x are both convex and as discussed above T^x is compact. As a 

result, Sion’s minimax theorem [41, Corollary 3.3] implies that 

-r o^f){p'"))) >0. (104) 

/cTjA/fc J-cr,Ar ' 

This shows that there exists a recovery map Ta,M G T^x that satisfies 

D{p^\\a) - D{M{p^)\\M{a)) > Dm{p"\\{%,M ° ^W)) for all x G A . (105) 

Since R^.At C T^X) this proves the statement of Theorem 2.2 for a finite set of states {p^}xex- 

Step 2: Extensions to an infinite set of states 

We now show how to extend the statement proven in Step 1 for a finite set of states to the set of all 
states. Let S := S(A) denote the set of density operators on A. Furthermore, for T G TPCP(i?, A), 
cr G P(A), and M G TPCP(A, B) we define the function family 

: Sa(A)RU {-oo} 

p 1-4 D{p\\a) - D{N{p)\\N{a)) - Du{p\\{T oAf){p)) . (106) 

We start by proving two regularity properties of the A-function family defined above. 
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Lemma 4.1. For any T € TPCP(-B,A), any a € P(^), and any M € TPCP(A, B), the function 
Scr(^) 3 p I—>■ A-r,t 7 ,j\fip) € K. defined in (106) is upper semicontinuous. 

Proof. For any fixed cr G P(^), the function Sct(^) 3 p H> D{p\\a) = tr(p(logp — logcr) = —H{p) — 
tr(plog(T) is continuous. This follows by Fannes’ inequality [10] that shows that S(^) 3 p H> i?(p) G K. 
is continuous. Furthermore the function 3 p i-3 —tr(plogcr) G M is bounded and linear and thus 

continuous. Since S(^) 3 p i -3 M{p) is linear and bounded it is continuous and thus the function Sct(^) 3 
p 1-3 D{M{p)\\N{(j)) is also continuous. In addition, the function 3 p i-3 i3M(p||(T o N){p)) is 

lower semicontinuous [42, Lemma 3.2] which then proves the assertion. □ 

Lemma 4.2. For any a G S(A), any p G Scr(^), and anyAf G TPCP(A, B), the function TPCP(i3, A) 3 
T 1-3 A 7 -crx(p) G K. defined in (106) is upper semicontinuous. 

Proof. The assertion follows from the fact that the mapping T^x 3 ^m(p|| {Ta,M o J\f){p)) G K+ 

is lower semicontinuous. To see this, we first note that by definition we have 11m(p||(7ctX °-^)ip)) — 
sup^gM i 3 (A 4 (p)||A 4 (( 7 [T, 7 \^ o A/’)(p))). Since Ta,M, Af, and M are linear bounded operators and hence 
continuous we find that Ta,Af >-3 D{M{p^)\\M{{Ta,jy oN'){p^))) is continuous as the logarithm K+ 3 
X 1-3 log a; G M is continuous. As the supremum of continuous functions is lower semicontinuous [ 6 , 
Chapter IV, Section 6 . 2 , Theorem 4 ], the assertion follows. □ 

Since S is compact (see, e.g., [42, Lemma C.l]), it follows that for any £ > 0 there exists a finite set 
S® of density operators on A such that any p G S is £-close to an element of S®. We assume without 
loss of generality that S® C for e' > e. Let G T^x be such that infpgs' A-jie^cr,M{p) > 0 whose 
existence has been established in Step 1. The set Ta.Af is compact because TPCP(i3, A) is compact [42, 
Remark C.3] and the intersection of a compact set with a closed set is compact. Since T^x is compact, 
there exists a decreasing sequence {£n}nGN and T G Ta,j\f such that 

lim e„ = 0 and T = lim T®" . (107) 

n—^oo n—^00 

Thus for all n G N we find by Lemma 4.2 

inf Aj-^j^{p)> inf limsup A 7 -em,cr,A/'(p) A hmsup inf Aj- Em .<tX(p) 

pGS ^ pGS ^ m—J-oo m—J-oo ^ 

> lim sup inf A'j-'^m ^a.Afip) > 0 , (108) 

m—>00 

where the final step follows by Step 1 . For any p G S let pn G S®” be such that lim„^oo p„ = p. (By the 
definition of S®”, it follows that such a sequence {pnlneN with p„ G S®" always exists.) By Lemma 4.1 
and (108) we find for any p G S 

^f.c,Mip) = 1™ Pn) > limsupA.^ j^(p„) > limsup inf Af ^{p) > 0 . (109) 

n—>-oo n—>-oo 

Since Rcr, 7 \^ ^ To-.TVA, this proves the first inequality in (21). 

The second inequality in (21) is a consequence of the monotonicity of the Renyi divergence in the 
order parameter [8] and of the fact that for any two states there exists an optimal measurement that 
does not increase their fidelity [14, Section 3.3]. This thus concludes the proof of Theorem 2.2 for 
finite-dimensional Hilbert spaces. 


5 Universal refinements of other entropy inequalities 

It is well known (see e.g. [38]) that the monotonicity of the relative entropy under trace-preserving 
completely positive maps is closely related to (i) strong subadditivity, (ii) concavity of the conditional 
entropy, and (iii) joint convexity of the relative entropy. Based on this relation. Theorem 2.2 can be 
used to derive universal remainder terms for the statements (i)-(iii). We note that universal means that 
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the recovery map that occurs in the remainder term does not depend on all possible parameters. Within 
this section we assume that A, B, and C are finite-dimensional Hilbert spaces. For n G N, the n-simplex 
is denoted by A„ := {x S R” : x > 0, The following three corollaries consist of two 

statements each. Statement 1 is derived from Theorem 2.1 which has the advantage that the remainder 
term is explicit (and hence also universal). However, since the lower bound of Theorem 2.1 is in form 
of a logarithm of a fidelity we cannot fully simplify the remainder terms (e.g., the convex sum appears 
inside the logarithm). This is circumvented in Statement 2 (which is derived from Theorem 2.2)^ at the 
cost that the remainder term is not explicit, but still universal. 

Applying Theorems 2.1 and 2.2 for a = idA'SipBC G P(A(g)H0C), p = pabc G Scr(A0H0C), and 
Mabc^ab = '^AB ® trc implies a strengthened version of the result that has been established in [42, 
Theorem 2.1, Corollary 2.4, and Remark 2.5]. 

Corollary 5.1 (Strong subadditivity). 

1. For any density operator pabc & S{A ® B ® C) we have 

I{A ■. C\B)p> —2\ogF[pABC-,T^B^Bc{pAB)) , (110) 

where F-B^Bci') = /r‘ l^/^o(^)’^pBc.ti'c(') defined in Theorem 2.1. 

2. For any density operator psc S S{B C) C) there exists a recovery map TZb^bc inside the set 
llpBc,trc defined in (16), such that for any extension pabc on A® B ® C we have 

I{A : C\B)p > Dm {pABc\\T^B^BciPAB)) > —‘2^0gF(pABC,T^B^Bc{PAB)) ■ (HI) 

From Corollary 5.1 we can deduce a universal remainder term for the concavity of the conditional 
entropy. We note that a similar remainder term has been conjectured in [5]. Furthermore, a remainder 
term that however is neither universal nor explicit has been proven in [45, Corollary 12]. 

Corollary 5.2 (Concavity of conditional entropy). 

1. For any pab G S(A 0 B) we have 

H{A\B)p-Y, i’{x)Fl{A\B)px > —2 log E v{x)F{p\s,nB^AB{p%)) , (112) 

for all ensembles {v{x), pf^g}xex withv G o-nd p^^ G S{A®B) such that pab = ^(^)Pab 

where TZb^ab{‘) = defined in Theorem 2.1. 

2. For any pab S S(A ® B) there exists recovery map TZb^ab € Rp. 4 B,trA with Rp,,B,trA defined 
in (16), such that 

H{A\B)p-Y,,.{x)H{A\B)p.>Y^ 

i^{x)Du{pab\\'^b^ab{Pb)) 

x^X x^X 

>-2j2’^{x)\ogF{p-XB:T^B^AB{p%)) , (113) 

for all ensembles {v{x), Pxb}x&x withv ^ t^x and p^^ G S{A®B) such that pab =J2xgx’^(^)Pab- 

Proof. We will provide a proof for Statement 2. Statement 1 follows by exactly the same reasoning. Let 
us consider the following classical-quantum state 

fixAB = Y^v{x)\x){x\x®p\b- (114) 

^Recall that the lower bound in Theorem 2.2 is in the form of a measured relative entropy. 
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By Corollary 5.1 and a simple fact ensuring that the measured relative entropy for orthogonal input 
states decomposes (see Lemma B.3 in [42]), we find 

H{A\B)p - Y, v{x)H{A\B)p. = H{A\B)^ - H{A\BX)^ = I{X : A\B)^ 

> DM{4>XAB\\Ti-B^AB{4>XB)) = ^ J^{x)Dm{pab\'^B^Ab{Pb)) ■ (H^) 

xGX 

We note that by Corollary 5.1 the recovery map TZb^ab is universal in the sense that it only depends 
on pAB- The second inequality in (113) is a consequence of the monotonicity of the Renyi divergence 
in the order parameter [8] and of the fact that for any two states there exists an optimal measurement 
that does not increase their fidelity [14, Section 3.3]. □ 

Theorems 2.1 and 2.2 imply a universal remainder term for the joint convexity of the relative entropy. 
We note that a similar remainder term has been conjectured in [39]. Corollary 13 in [45] proves such a 
remainder term that however is neither universal nor explicit. 

Corollary 5.3 (Joint convexity of relative entropy). 

1. For any v £ Xx, with £ P(^) for all x £ X, axA = Sxgat ® , any 

Px £ So-a>(^) and p = we have 

Y - D{p\\a) > -21og^Ka^)F(p", '^lyxAMx ip)) , (116) 

a:€S 

where 'R-a-xAMxi') — /r‘^^/^ o(^)^CTxA,trx(’) defined in Theorem 2.1. 

2. For any v £ Ax, {a^}xex with £ P(A) for all x £ X, and axA = '^xex ® 

there exists a recovery map TZ £ Ra-xA.trx such that 

Y Ha^)D{p^\Wn - rnw) > Y ^^(^)l^M(p1|7^(p)) > -2 n{x) \ 0 gF{p\n{p)) (117) 

x^X x^X 

for all {p^}x^x with px £ So.»>(A) and p = J2 xgx ^iw)p^- 

Proof. We will provide a proof for Statement 2. Statement 1 follows by exactly the same reasoning. Let 
us consider the states 


PXA ■= Y ^{^)\af){x\x ® p"^ and p = pA = Y . (118) 

x^X x^X 

Since the relative entropy decomposes for orthogonal input states (see Lemma B.2 in [42]), Theorem 2.2 
implies that there exists a recovery map TZ £ Bo-xaMx such that 

Y - ^iP\W) = DipxAlWxA) - D{pa\\(7a) 

x^X 

> Dm{pxa\\TZ{p)) = Y Hx)Dm{p^\\TZ{p)) . (119) 

xGA' 

The second inequality in (117) is a consequence of the monotonicity of quantum Renyi divergence in 
the order parameter [32] and of the fact that for any two states there exists an optimal measurement 
that does not increase their fidelity [14, Section 3.3]. □ 
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6 Approximate quantum error correction 

Theorem 2.1 allows for establishing an alternate, information-theoretic set of conditions for approximate 
quantum error correction. Recall that in quantum error correction the main goal is to protect a set 
of states from the action of a noisy quantum channel, by encoding this set into a subspace of a given 
Hilbert space. The quantum error correction conditions given in the seminal works [4, 24] are necessary 
and sufficient for perfect quantum error correction. Although these works were essential for gaining an 
understanding of quantum error correction, the conditions given represent a strong idealization, since 
we can never hope in practice to have perfect quantum error correction. So this realization motivated 
some follow-up works on approximate quantum error correction (see [3, 30] and references therein), in 
which one seeks to determine conditions under which approximate recovery from the action of a given 
noisy channel is possible. We mention that some of these works made essential use of the Petz recovery 
map. 

In [17], information-theoretic conditions for perfect error correction were given in terms of the quan¬ 
tum relative entropy. We recall their result: Let A be a Hilbert space, let C be a subspace of A (called 
the “codespace”), and let H denote the projection onto C. Then 

{VpeS(C) : l^(plln)=i^(A7(p)llAr(n))} ^ {VpeS(C) : p=(PnxoA7)(p)} , (120) 

where Pnx(-) is the Petz recovery map defined in (4). So perfect error correction is possible if and only 
if for all p £ S(C') the pairwise distinguishability of p with H does not decrease under the action of the 
noisy channel Af. 

Given the statement in (120) and the prior work on approximate quantum error correction, it is 
natural to wonder whether a robust version of (120) exists. Indeed, Theorem 2.1 implies such a state¬ 
ment, establishing necessary and sufficient information-theoretic conditions for approximate quantum 
error correction. Loosely speaking, we can now say that approximate error correction is possible if any 
only if the pairwise distinguishability of p with H does not decrease by too much under the action of 
the noisy channel Af for all p £ S(C'). The corollary below is a consequence of Theorem 2.1 and some 
other known facts. 

Corollary 6.1. Let e £ [0,1], A and B be a finite-dimensional Hilbert spaces, C be a subspace of A 
(called the “codespace”), Af £ TPCP(A, H) a quantum channel, and let H denote the projection onto 
C. If for all p £ 8(17) we have 

D{p\\Il)-D{N{p)\\Af{n))<e , (121) 

then it is possible to approximately recover every state p £ S{C), in the sense that we have for all 

p £ 8(6-) 

F{p,{nn,NoN){p))>l-\e . (122) 

Conversely, if (122) holds for e £ [0,1], then we have for all p £ 8(17) 

D{p\\n) - D{Afip)\\Af{n)) < yilog(dimC) + h 2 {V^) , (123) 

where /i 2 (p) := —p^ogp— (1—p) log(l—p) is the binary entropy, with the property that limpN^o ^ 2 (p) = 0. 

Proof. The first statement is a direct consequence of Theorem 2.1, found by setting p = p, tr = H, and 
Af = Af and then applying the inequality — log(a;) > 1 — x, which holds for x £ [0,1]. 

To prove the second statement, suppose that (122) holds. By the Fuchs-van-de-Graaf inequality [13], 
(122) implies that for all p £ S((7) we have 

^ l|p-(^n,A/'oA/')(p)||i < \/e . (124) 
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Then consider the following chain of inequalities, which holds for all p G S(C'): 

D{p\\n) - i?(Ar(p)||Af(n)) < D{p\\u) - D{{nu,M o^^){p)\\{nu,^f (125) 

= D{p\\U)-D{{^lu,^fo^^)ip)\\n) (126) 

= -Hip) + o Ar)(p)) + tr([(7enx o Ar)(p) “ P] logH) (127) 

< •v/£logdim(C') +/i 2 (-\/e)- (128) 


The first inequality is a consequence of monotonicity of quantum relative entropy with respect to the 
recovery map TZn.Af- The first equality follows because iTZn,j\r o-^)(n)) = II- The second equality 
is a rewriting, and the last inequality follows from the Fannes-Audenaert inequality [2] (see also [46, 
Lemma 1]) and the facts that supp(p) C supp(n), supp((72.nx °-^)ip)) ^ supp(n), and logll = 0 on 
the support of 11. □ 

7 Conclusion 

In this work, we showed that for any non-negative operator cr and for any channel Af there exists an 
explicit and universal recovery map and 

Dip\\a)-D{Afip)\\Afia)) > -21ogF(p, (7^,,,A^ o Ar)(p)) (129) 

for all density operators p such that supp(p) C supp(cr). A consequence of its universality is that the 
recovery map satisfies (TT-o-x o A/')(cr) = a. The present work thus constitutes an improvement 

over [45] since the recovery map is constructed explicitly and is independent of p. Knowing the explicit 
structure of the recovery map that satisfies (129) can be helpful in various scenarios. For example if 
this is to be implemented by experimentalists or in an actual quantum computer it is helpful to have 
an explicit recovery map. We showed that the inequality (129) implies universal and explicit remainder 
terms for other entropy inequalities such as strong subadditivity, concavity of the entropy, and the joint 
convexity of the relative entropy. Furthermore, it can be useful in the context of quantum sufficient 
statistics and approximate quantum error correction. 

Two questions that remain unanswered are, first, if (129) can be tightened in the sense that its 
right-hand side can be replaced by a relative entropy term of the form DipWTZcr^M ° Af)ip)). Secondly, it 
is also unknown if a bound in terms of a measured relative entropy (in the form of (11)) remains valid 
for an explicit recovery map — recall that Theorem 2.2 proves the existence of a universal recovery map 
that satisfies (11) but this recovery map is not explicit. Theorem 2.1, which shows an explicit recovery 
map that satisfies (129), could therefore provide a new approach to attack these two open questions. 


Appendix 

A Proof of Lemma 3.3 

We begin by recalling Hirschman’s strengthening [18] of the Hadamard three-line theorem (see, e.g., 
[16, Lemma 1.3.8]): 

Lemma A.l. Let S' := {z G C : 0 < Re { 2 } < 1} and let giz) he holomorphic on the interior of S and 
continuous on the boundary, such that 

supexp {—a Jlmi:]} log \giz)\ < A < 00 , (130) 

z^S 

for some fixed A and a < tt. Then for 9 G (0,1), the following bound holds 

log|g(6')l< [ dt (ae(t)log(l 5 (it)l^"®)-f/3e(t)log(lg(l-f it)]®)), (131) 

Jr ^ ' 

where agit) and feit) are defined in (30). 
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We can now prove Lemma 3.3. 


Proof of Lemma 3.3. The proof of this theorem is well known, but we provide it for completeness. For 
fixed 9 G (0,1), let qg be the Holder conjugate of pg, defined by 

11 . ^ 
— + — = 1 . 132 

P0 qe 

Similarly, let go and qi be Holder conjugates of po and pi, respectively. We can find an operator X such 
that 


||X||^^ = 1 and tr{XG{e)) = ||G(0)||^^ . (133) 

We can write the singular value decomposition for X in the form X = UD^I'^^V (implying tr(Zl) = 1). 
For z £ S, define 

X{z). (134) 

As a consequence, X{z) is holomorphic on the interior of S and continuous on the boundary. Also, ob¬ 
serve that X(6) = X. Then the following function satisfies the requirements needed to apply Lemma A.l: 

g{z) :=tT{X{z)G{z)) . (135) 


Indeed, we have that 

log ||G(6»)||p^ = log|5(6»)| < ^dt (ae(t)log(|5f(it)|^”®)-b^e(01og(l5(l+ iOI^)) ■ (136) 

Now, from applying Holder’s inequality and the facts that ||Ar(jt)||^^ = 1 = ||X(1 -f iOlIqi) Had that 
| 5 (it)| = |tr(X(it)G(it))| < ||X(it)||^^ ||G(it)||^„ = ||G(it)||^„ (137) 

and 

|p(l -I- it)| = |tr(Ar(l -I- it)G(l -I- it)) | < ||Ar(l -|- it)||qj ||G(1 -I- it)||pj = ||G(1 -b it)|lpj . (138) 

Bounding (136) from above using these inequalities then gives (29). □ 
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